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METRIC ENTROPY OF HOMOGENEOUS SPACES 


STANISLAW J. SZAREK 

Abstract. For a (compact) subset AT of a metric space and £ > 0, the cover¬ 
ing number N {K, e) is defined as the smallest number of balls of radius e whose 
union covers K. Knowledge of the metric entropy, i.e., the asymptotic behaviour 
of covering numbers for (families of) metric spaces is important in many areas of 
mathematics (geometry, functional analysis, probability, coding theory, to name a 
few). In this paper we give asymptotically correct estimates for covering numbers 
for a large class of homogeneous spaces of unitary (or orthogonal) groups with re¬ 
spect to some natural metrics, most notably the one induced by the operator norm. 
This generalizes earlier author’s results concerning covering numbers of Grassmann 
manifolds; the generalization is motivated by applications to noncommutative prob¬ 
ability and operator algebras. In the process we give a characterization of geodesics 
in U{n) (or SO{m)) for a class of non-Riemannian metric structures. 


1. Introduction 

If {M,p) is a metric space, A C M a compact subset and e > 0, the covering 
number N{K, e) = N{K, p, e) is defined as the smallest number of balls of radius £ 
whose union covers K. If A is a ball of radius A in a normed space of real dimension 
d, it is easily shown (by a volume comparison argument) that, for any £ G (0, R], 

{R/ef < N{K, £)<(! + 2R/eY. (1) 

The lower and the upper estimate in (1) differ roughly by a factor of 2“^, and for 
many applications such an accuracy is sufficient. On the other hand, determining 
more precise asymptotics for covering numbers and their ’’cousins”, packing numbers 
(see section 2), e.g. for Euclidean balls is a nontrivial proposition (and a major 
industry). In this paper we attempt to obtain estimates of type (1) for homogenous 
spaces of the orthogonal group SO{n) or the unitary group U{n) (like e.g. the 
Grassmann manifold Gn,k or the Stieffel manifold, equipped with some natural metric; 
we admit metrics induced by unitary ideal norms of matrices, most notably the 
operator norm). A typical result will be: if M is a “nice” homogenous space of 
SO{n) or U{n) and e G (0,d(M)] (where 6{M) is some computable “characteristic” 
of M, in the more “regular” cases 6{M) ^ diamM), then 
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{cdiamM/eY < N{M,e) < (C diamM/eY 


( 2 ) 


where d is the (real) dimension of M and c and C are constants independent of £ 
and (largely) of M. Of course universality of the constants in question is the crucial 
point. 

We note in passing that (again, by a standard “volume comparison” argument) (2) 
is equivalent to the (normalized) Haar measure of a ball of radius e being between 
{cie/diam and {Cie/diam , where Ci,Ci > 0 are universal constants 

related to c, C. We also point out that the underlying metric being typically non- 
Riemannian, the methods of Riemannian geometry do not directly apply. 

This paper is an elaboration of the note by the author, where (2) was proved 
in the special case of M being SO{n), U{n) or a Grassmann manifold Gn,k- (An¬ 
other argument was given later in [^.) The original motivation and application was 
the hnite-dimensional basis problem; more precisely, (2) was used in the proof of 
(Theorem 1.1 in [M): 


There is a constant c > 0 such that, for every positive integer n, there exists an 
n-dimensional normed space B such that, for every projection P on B with, say, 
.Oln < rankP < .99n, we have ||P ; P —>■ i?|| > c^/n. 


The results of []I^ were subsequently applied to other problems in convexity, local 
theory of Banach spaces, operator theory, noncommutative probability and operator 
algebras (cf. e.g. 0, [|^], |^, |^). It turned out recently (see |^) that some 
questions from the last two helds lead naturally to queries about validity of estimates 
of type (2) in settings more general than that of []^ (which, additionally, had a rather 
limited circulation). It is the purpose of this paper to provide a reasonably general 
answer to such questions. However, describing asymptotics for completely general 
homogenous spaces of SO{m) or U{n) is, in all likelihood, hopeless. In this paper we 
cover a number of special cases, including those that have been explicitly inquired 
about. We identify (easily computable) invariants relevant to the problem and provide 
“tricks” that could be potentially useful to handle cases not addressed here. Ecxept 
for brief comments here and there, we restrict our attention to M = G/H, where 
G = SO{m) or U{n), and H - a connected Lie subgroup of G; but clearly most of 
our analysis can be extended to other compact linear Lie groups. 

The organization of the paper is as follows. 

Section 2 explains notation and presents various preliminary results concerning 
covering numbers and their relatives, unitarily invariant norms and the exponen¬ 
tial map. Most of these are known or probably known. In section 3 we show (in 
Proposition 6) that cosets of one parameter semigroups are the geodesics in U{n) 
(or SO{m)) endowed with an intrinsic metric induced by a unitarily invariant norm; 
this result could be of independent interest. In section 4 we discuss several simple 
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examples that point out possible obstructions to estimates of type (2) and suggest 
invariants mentioned above. We then use the results of the preceding two sections to 
show estimates of type (2) for an abstract class of homogeneous spaces that contains 
t/(n), SO{n) and Gn,k (Theorem 8). In section 5 we discuss various possibilities for 
relaxation of assumptions from section 4, in particular we cover the special cases 
motivated by applications. At the end of section 5 we briefly address the issue of 
extending the results to metrics generated by unitarily invariant norms other than 
the operator norm. 

Acknowledgement. The author would like to express his gratitude to D. Voiculescu, 
whose encouragement was instrumental both in the inception and the completion of 
this work. The hnal part of the research has been performed while the author was 
in residence at MSRI Berkeley; thanks are due to the staff of the institute and the 
organizers of the Convex Geometry semester for their hospitality and support. 

2. Notation and Preliminaries 

We start with several remarks clarifying the relationship between covering numbers, 
packing numbers and their slightly different versions that exist on the market. First, 
since the centers of balls in the dehnition of N{K, e) , as given in the introduction, do 
not necessarily need to be in K, the exact value of N{K, ■) may depend on the ambient 
metric space (M, p) containing K. Accordingly, it is sometimes more convenient to 
allow sets of diameter < 2e in place of balls of radius S] call the resulting the quantity 
N'{K,e). If the centers of the balls in the dehnition are required to be in K, call 
the quantity N"{K,e). Finally, let the packing number N{K,e) be dehned as the 
maximal cardinality of an e-separated (i.e. p{x, x') > e if x ^ x') set in K. The 
quantities N, N' and N are related as follows 


N'{;e) < N{;e) < N'\;e) < N{;e) < N'{;e/2). (3) 


Consequently, for our “asymptotic” results the four quantities are essentially inter¬ 
changeable. 

If the metric space (M, p) is actually a normed space with a norm || ■ || and unit ball 
B, we may write N{K, || ■ ||,e) or N{K,B,e) instead of N{K,e) or N{K,p,e). The 
technology for estimating covering/packing numbers of subsets (particularly convex 
subsets) of normed spaces is quite well-developed and frequently rather sophisticated 
(see i. H). We quote here a simple well-known result that expresses in 

terms of a “volume ratio”, and of which (1) is a special case. 


Lemma 1. Let K, B (Z with B convex symmetric. Then, for any e > 0, 

1 < N{K,B,b) < N"(K,B,e) < N\K,B,eI2) < (Iy'BTLZiLRI 

e vol B e 


vol B 
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The next lemma is just an observation which expresses the fact that the cover¬ 
ing/ packing numbers are invariants of Lipschitz maps. 

Lemma 2: Let {M,p) and (Mi,pi) be metrie spaees, K C M, ^ : K ^ Mi, and 
let L > 0. If $ verifies 

pi{^{x),^y)) < Lp{x,y) for x,y e K 

(i.e. ^ is a Lipschitz map with constant L), then, for every e > 0, 

N'\^{K),pi,Le)<N'\K,p,e). 

Moreover, N" can he replaced by N' or N and, if <h can be extended to a function 
on M that is still Lipschitz with constant L, also by N. 


We now turn to our main interest, the unitary group U{n), the (special) orthogonal 
group SO{n) and their homogeneous spaces. (As 0(n) is geometrically a disjoint 
union of two copies of SO{n), all statements about SO{n) will easily transfer to 
0(n).) Throughout the paper we will reserve the letter G to denote, depending on 
the context, U{n) or SO{n). Similarly, we will reserve the letter Q to denote the Lie 
algebra of G, the space u{n) or so{n) of skew-symmetric matrices. Since G and Q are 
subsets of M{n) (the algebra of n x n matrices, real or complex as appropriate), they 
inherit various metric structures from the latter. In this paper we focus on the one 
induced by the operator norm (as an operator on the Euclidean space, that is), but 
will also consider the Schatten Cp-norms ||a;||p = {tr with the operator norm 

II ■ Hop = II ■ ||oo being the limit case. We will use the same notation ||a;||p for the 
norm on R” or C", but this should not lead to confusion. We will also occasionally 
mention other unitarily invariant norms (i.e. verifying ||a;|| = ||Ma;n|| if a; G M{n) 
and u,v E G), each necessarily associated with a symmetric norm on (which we 
will also denote by || ■ ||) via ||a;|| = ||(sfc(a;))^^]^||, where <si(a;),... ,Sn{x) are singular 
numbers of x. 


If p is a metric on G and H C G a closed subgroup, we consider the homogeneous 
space M = G/H of left cosets of R in G as endowed with the canonical quotient 
metric Pm{E,F) = inf{p(M,t;) : u E E,v E E}. A fundamental example is that 
of the Grassmann manifold Gn,k of /c-dimensional subspaces of R"^ (resp. C”): the 
relevant subgroup H of SO{n) (resp. U{n)) consists of matrices of the form 


ui 0 
0 U2 


where Ui E SO{k) and U 2 E SO{n — k) (resp. U{k),U{n — k)) and the identhcation 
of cosets of H with the subspaces is via uH ~ uE^, where E^ is the linear span of 
the first k vectors of the standard basis of R”^ (resp. C”). 

It will be usually convenient to consider the intrinsic metric, which we will again 
call p, on M = Lf{ri) (or SO{n), or the homogeneous space): p{u,v) is the inhmum 
of lengths of curves in M connecting u and v. Since this is going to be relevant 
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later, we observe that the inhmum may be taken over absolutely continuous curves 
7 : [a, 6] —M (the inhmum is then achieved, while any rectihable curve parametrized 
by arc length is absolutely continuous). Then the length 

^(7)= hwmdt, (4) 

Jo 

where || ■ || is the appropriate (unitarily invariant) norm on M{n) ii M = G (otherwise 
II ■ II can be interpreted as a quotient norm on the corresponding quotient of the 
relevant Lie algebra; cf. (14) in section 4 and comments following it). We also point 
out that all the metrics we consider being bi-invariant, any curve in M can be lifted 
(by compactness and elementary properties of the Li-norm) to a “transversal” curve 
in G of the same length. The correct abstract framework for these considerations is 
that of Finsler geometry (see e.g. P|, but the manifolds we consider being canonically 
embedded in normed spaces we can afford to be more “concrete”. 

For future reference we point out that the (operator) norm distance and the cor¬ 
responding intrinsic distance are related as follows 

||u-u|| = |l-e*^(“’^)|. (5) 

This follows from Proposition 6 in the next section; however, here we just wish to 
point out that the two metrics differ by a factor of ti/2 at the most and this particular 
fact is implied by the more or less obvious inequalities p(m, u) > ||m— u|| > |1 — 

Since, by dehnition, the corresponding quotient metrics are distances between cosets, 
the corresponding two metric structure on homogeneous spaces are related in the 
way analogous to (6). Accordingly, estimates of type (2) will transfer easily from one 
metric to the other, and the choice of the one to work with will only be a matter of 
convenience and/or elegance. 

Because of the invariance of the metric under the action of G (= U{n) or 50(n)), 
one can give estimates for the covering numbers of M (analogous to those of Lemma 
1) in terms of the Haar measure of balls (cf. the comment following (2)). However, 
since the dependence of the measure of a ball on the radius is much less transparent 
now than in the “linear” case, such estimates are not necessarily useful. To overcome 
this difficulty we “linearize” the problem via the exponential map (composed with 
the quotient map g : G —> M if necessary) and then use Lemma 2. Since we operate 
in the “classical” context, the exponential map is the standard one 

CXD 

exp a: = = y/ — for a: G M(?7-), 

k=i 

and it will be normally sufficient to consider the restriction of exp to ^ (= u{n) 
or so(n)), the Lie algebra of G (= U{n) or SO{n)). In order to be able to apply 
Lemma 2 we must “understand” <F = g o exp; specihcally we need to know for which 
K (Z Q we have ^{K) = M (or at least when <F(iF) is “large”) and for which K 
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the restriction (resp. is Lipschitz. Concerning the hrst point, it is well 

known that, in our context, exp(^) = G. Moreover, we have 

Lemma 3. Let K = {x ^ Q ; ||a;||oo < tt} be the ball of radius n in Q in the 
operator norm. Then 

(a) exp(i^) = G 

(b) exp is one-to-one on the interior of K. 

The above is a special case of a more general fact for Lie groups, but in the 
present setting can be seen directly from the fact that every unitary matrix can be 
diagonalized, with the argument for SO{n) being just slightly more complicated. 

Lemma 3 asserts that G resembles, in a sense, a ball in Q. However, for our 
purposes we need more quantitative information about exp, which we collect in the 
next lemma. 


Lemma 4. For any unitarily invariant norm and the corresponding metric on G 
(extrinsic on intrinsic), the map exp : Q ^ G is a contraction. 

On the other hand, let || ■ || any unitarily invariant norm and set, for 6* > 0, 

\\x-y\\ 

Then 4>{6) > 0 if 6 <7r. Moreover, if 0 e [0,27r/3), then 

OO 

k=l 

In particular (fiO) > .4: if 6 < 7r/4. 

Proof. The first assertion is classical for the extrinsic (norm) metric and hence 
follows formally for the intrinsic metric. For the other assertions, we observe first 
that since the derivative of the exponential map at 0 is the identity. 



: x,y e g, X ^ y, ||a;||oo < 0, |||/||oo < 0}. 


lim 6(9) = 1. 
e^o+ 


Let x,y be as in the dehnition of 4>{6). We have 
e — e^ = 62 (e 2 — 62 ) + (e 2 — e 2 )e 2 
= 2(62 — 62 ) + (62 — /)(62 — 


y 

62 


) + (e^ 





( 6 ) 


and so, by the ideal property of unitarily invariant norms, 

\\e--ey\\ = ||e!-e?||(2-||ei-/||-||A-/||) 

Iterating and using (7) we obtain the third (and hence the last) assertion of the 
lemma. For the second assertion (0(6^) > 0 if 6^ < tt, not used in the sequel), we just 
briefly sketch the argument for G = U{n). Let 9 G (0,7r) and 5 > 0. Consider hrst 
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the case of the operator norm. We need to show that if A, B are Hermitian with 
spectra contained in [—6*, 6] and ||e®"^ — e^^\\ < 6, then ||y4 — B\\ < C{6)6, where C{6) 
depends only on 9 (and not on A, B or n). By |^, Theorem 13.6, the eigevalues of 
A and B (multiplicities counted) are, ia a certain precise sense, “close”, and so, by 
perturbation, we may assume that they are identical; we may also assume that all 
those eigenvalues are inegral multiples of 8. Let u G U{n) be such that B = uAu~^; 
we need to show that ||e*^M — < 48 implies \\Au — uA\\ < C{6)8 and this 

follows by writing u as a “block matrix” in the spectral subspaces of A. For a general 
unitarily invariant norm we note that the assertion is roughly equivalent to uniform 
boundedness (with respect to the norm in question and with a bound depending only 
on 9) of the inverse of the derivative of the exponential map. That derivative is, in 
a proper orthonormal basis, an antisymmetric “Schur multiplier” (see |^, Theorem 
2.14.3 and its proof). As a consequence, the inverse is also a “Schur multiplier” and 
so its norm with respect to the operator norm equals to the norm on the trace class 
Cl (by duality) and dominates the norm with respect to any unitarily invariant norm 
by interpolation (cf. |0, §28 or 0). 

Remark. In all likelihood, a version of Lemma 4 (and of Lemma 5 that follows) 
should be known, at least for the operator norm, but we couldn’t find a reference. It 
would be nice to have an elegant proof which gives good constants in the full range 
of 9 (g (0,7r)). We point out that the “Schur multiplier” argument indicated above 
provides a simple “functional calculus” proof (with good constants) in the case of the 
Hilbert-Schmidt C' 2 -norm. 

Lemma 5. Let G = U{n) (resp. SO{n)) and p the intrinsic metric on G induced 
by a unitarily invariant norm || ■ || on M{n) . Then, for any x,y G Q, 

p{e’+>,eV)<\\lx,y]l 


Proof. The argument is similar to, but slightly more complicated than that of the 
previous lemma. Denote, for f > 0, 

^P{t) = 

Clearly -0(0) = 0. Moreover, expanding the exponentials and noting that p{u, v)/\\u — 
n|| —*• 1 as ||m— n|| —> 0 (this follows from (6), but can also be seen from the inequalities 
in the paragraph following (6), which do not depend on Proposition 6) we conclude 
that 


lim 

t^o 






( 7 ) 
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Now 


P{ 


e"+^ e"e^) 


, / t 4 _„ ^+y V\ , / ^+y ^ y £. y s. v\ 

< p { e ^+ y,e 2 6262 ) +p(e 2 6262,62626262 ) 

, X y X y X x + y y X x+y y 

+p(62626262,626 2 e2)+p(626 2 62,6^6^) 

/ ^+y £ / y. s. £±K\ 

= 3p(6 2 ,6262) +p(6262,6 2 ) 


Hence ■^(1) < and, by the same argnment, 'ip{t) < 4-0(1) or for 

f > 0. In combination with (8) this implies the lemma. 


3. Some Non-Riemannian Geometry 

Onr last anxiliary resnlt involves non-Riemannian geometry of G = U{n) (or 
SO{n)). It is very well known (in a much more general context) that if G is endowed 
with a bi-invariant Riemannian structure (which is, in our case, the one induced 
by the Hilbert-Schmidt G 2 -norm on M{n)), then the geodesics of G are exactly the 
cosets of one-parameter subgroups (see P, p. 148, Ex. 5, 6). It is not immediately 
clear how general is this phenomenon. Since geodesics are normally dehned via affine 
connections, we should make clear here that we emphasize the “metric” approach: 
a curve in a manifold M endowed with a metric is a geodesic if it locally realizes 
the (intrinsic) distance between points as explained in the paragraph containing (4) 
(the argument can be presumably rewritten by starting from the affine connection 
induced by the group structure, though). We have 

Proposition 6. Let || ■ || be a unitarily invariant norm on M{n) and p the induced 
intrinsic metric on G (= U{n) or SO{n)). Then 

(a) cosets of one parametric semigroups (i.e. curves of the form ■yit) = ue^^, u & G, 
X E Q) are geodesics in {G,p) 

(h) if II ■ II is strictly convex (which happens in particular if || ■ || = || ■ ||p for some 
p G (l,cx))0 then all geodesics are, up to a change of parameter, of the form given in 
(a) (or arcs of curves of such form) 

(c) if, furthermore, the spectrum of u~^v does not contain —1, the curve of shortest 
length (geodesic arc) connecting u and v is unigue. 

Remarks. (1) A unitarily invariant norm on M{n) is strictly convex (i.e. the unit 
sphere {x : ||a;|| = 1} does not contain a segment) iff the associated symmetric norm 
on R" is (cf. [^]). The operator norm and the trace class Gi-norm are not strictly 
convex. 

(2) It is a somewhat delicate issue how smooth should be the curves we consider, 
particularly since the results from |Q, to which we refer quite heavily, do not £t 
precisely our needs exactly in that respect. For the purpose of following the proof 
below, the reader should think of all functions as being at least G^. As indicated in 
the previous section, absolutely continuous functions provide a convenient framework; 
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we comment on the fine points of the present argument and on their relevance to |l| 
at the end of the proof. 

Proof. Since SO{n) is a Lie subgroup of U{n), it is enough to prove the lemma for 
the latter, (a) By unitary invariance of the metric it is enough to prove the assertion 
for “short arcs of one parameter semigroups”, i.e. curves of the form 70 (t) = 
t G [0,1] where ||x|| is “small”. By (5), the length £( 70 ) equals ||x||. We need to show 
that any curve 7 in U{n) connecting 70 (0) = / and 7 o(l) = is of length > ||x||. 
The argument is based on (and in fact very close to) the results on spectral variation 
of unitary matrices presented in |]l|, §13, 14. Indeed, Theorem 14.3 and Remark 14.4 
of |I|, when specified to unitary matrices, say in effect that the map S associating 
to a matrix its spectrum is a contraction when considered as acting from {U{n),p) 
to (C”, II ■ ||)/*S'„, where || ■ || is now the symmetric norm associated to the unitarily 
invariant norm in question (a priori defined on but canonically extendable to 
C^) and Sn is the symmetric group acting on O’* by permuting the coordinates (we 
recall that we are working with the intrinsic metric). This in turn implies that, for 
any curve 7 in U{n) connecting I and e^, we have 

%)>^(S(7)). ( 8 ) 

Since 7 lies in U{n), its image under S lies in T"'/S'„, where T = {z G C : \z\ = 1}. 
Let now E : R"" —T” be the exponential map of the group T” (i.e., E{{^k)k=i) = 
(exp(i^fc))^^^) considered as acting from (R"", || • ||) to T” equipped with the intrinsic 
metric inherited from (C", || • ||). Then E is a. local isometry, e.g. it is an isometry 
when restricted to [— 7 r/ 2 , 7 r/ 2 ]". Consequently, the appropriately restricted induced 
map E : R"'/S'„ — T^/Sn is also an isometry and so, if ||x|| is “small”, the length 
of 7 i = £^“^(S( 7 )) is the same as that of 11 ( 7 ), in particular, by (9), £( 71 ) < ^( 7 ). 

Now observe that 71 connects 0 (g R”) and A = (Ai,..., A„), where (zAi,..., iXn) 
are eigenvalues of x with multiplicities. Clearly ||x|| = ||iA|| = ||A|| and they are all 
equal to the distance between 0 and A in R^/Sn. Accordingly 

||x|| < £( 71 ) (in R750 = ^(S(7)) (in T77) < ^( 7 ) (in {U{n),p)) (9) 

and the assertion (a) is proved. 

(b) Again, it is enough to consider curves of the form 70 (t) = e*^, f G [0,1] where 
||x|| is “small”. We have to show that if 7 is a curve in U{n) connecting I and 
such £( 7 ) = ||x|| = £( 70 ), then (after a change of parameter, if necessary), 7 = 70 . In 
other words, we need to investigate the cases of equality in ( 10 ). 

Concerning the first inequality in (10), we observe that since || ■ || is strictly con¬ 
vex, a straight line segment in (R"", || • ||) with endpoints A, A' is strictly shorter 
than any other curve connecting these points. As a consequence, the same is true 
for the corresponding quotient metric in R”/ 7 provided the distance between A, A' 
in the quotient metric equals ||A — A'|| (in particular if one of the endpoints is 0). 
Thus we may have equality in the first inequality in (10) only if 71 is a segment. 
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i.e. if (after a change of parameter, if necessary) the spectrnm of jit), or T,{'y)(t), 
eqnals (exp(zAif),... ,exp(iA„t)), where, as before, (iAi,... ,iXn) are eigenvalnes of 
X. Let i/ii,... be all distinct eigenvalnes of x, and let, for k = 1,... ,m and 
t G [ 0 , 1 ], Pkit) be the spectral projection of 7 (f) corresponding to the eigenvalne 
^k(t) = exp{ifikt) (since x was assnmed to be “small”, z/^’s are also distinct). If 
follows from elementary fnnctional calcnlns that Pk{t) is a continnons fnnction of t 
and, moreover, has the same smoothness (in t) as 7 (t). To prove that 7 = 70 we need 
to show that Pk{t) is constant in t for k = 1,... ,m. To this end, we need to analyze 
the eqnality case in the second ineqnality in ( 10 ), and for that we mnst go into the 
proof of Theorem 14.3 of [^. In onr setting and notation, it is shown there (p. 68 , 
eqnation (14.5)) that 

<(E( 7 )) < I' II W(*)ll* < I' liy(«)ll<i« = <(7). ( 10 ) 

where, for u G U{n), Pu is a (necessarilly contractive) orthogonal projection in M{n) 
onto the commntant of u (in onr case only the restriction of the projection to Q is 
relevant). By the strict convexity of || ■ ||, we have UPMaill < ||x|| nnless x is in the 
range of Pu and so the second ineqnality in ( 11 ) (and hence the second ineqnality in 
(10)) is strict nnless ['y'it),'j{t)] = 0 for amost all t G [0,1]. Now, 7 (f) being normal, 
■y'{t) mnst also commnte with the spectral projections of 7 (t), i.e. 

[ 7 '(f), Pk{t)] = 0 ioT k = 1,... ,m. ( 11 ) 

Since 7 (f) = YXjLi'>^jit)Pjit), (H) translates into 

m 

E Pk{t)] = 0 for fc = 1 ,..., m. ( 12 ) 

For (small) h > 0, choose v{h) G U{n) so that it conjngates k'j(t)Pj(t) and 
J2jLi k'jit)Pj(t + h) (it then also conjngates Pk(t) and Pkit + h) for fc = 1,..., m) 
and so that n(0) = I and v{h) depends “smoothly” on h. Such a “smooth” choice 
is possible wherever 7 is “smooth”: as we indicated earlier, (Pfc)^i is then also a 
“smooth” curve in the the ’’generalized Stieffel manifold” (i.e., the quotient of U{n) 
by the commntant of 7 (t); see section 5 for an elaboration of the concept and its 
framework) and so it can be locally lifted to an “equally smooth” curve in U{n). If 
y = yit) = then it is easily seen that 

Pkit) = [y,Pk{t)]ioT k = l,...,m (13) 

and so (13) translates into 

m 

[i/E i'j{t)Pj{t)\ G commntant of 7(t), 
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and so (dropping t, which is hxed, from the formulae), 

m mm 

[y, E = E Pk[y, Y. ^jPj]Pk, 

j=l k=l j=l 

which is easily seen to be 0. On the other hand, 

m mm 

[y, Y = Y Y(^j - ^k)PkyPj- 

j=i k=ij=i 

Since {PkM{n)Pj, k = 1,... ,m, j = 1,... ,m} form an orthogonal decomposition 
of M{n), the above double sum can be 0 only if all the terms are 0. Recalling 
that z/fc are distinct, we deduce that j ^ k implies PkyPj = 0. Hence [y,Pk(t)] = 
PjyPk — PkyPj = 0 and so, by (14), Pl.(t) = 0 for k = 1,... ,m. Since t was 
an arbitrary point in [ 0 , 1 ] at which 7 (f) was “smooth”, it follows that Pl.{t) = 0 for 
almost all t and so each Pk{t) is in fact constant, as required. This proves part (b). 
(c) This is immediate; by (b), the geodesic must be (up to a change of parameter) of 
the form 7(f) = ue^^,t G [0,1] with £(7) = ||x|| and = u~^v. Since the spectrum 
of u~^v does not contain — 1 , the last equation has unique solution xq such that 
||a^o||oo < TT, and any other solution xi verihes ||a;i|| > ||to||, thus leading to a strictly 
longer curve. 

Comments. (1) We need to clarify that the results presented in and used 
above were obtained under “piecewise assumptions, both for the curve and for 
(roughly speaking and in our notation) the distance function in (C”, || • ||)/5'„ applied 
to 2 ( 7 ). Such framework was good enough for [|I| (their analysis, even though it fails 
to identify all singular points of the latter “distance function” in the general case, can 
be easily patched by approximating a general unitarily invariant norm by a smooth 
one), but is insufficient in our context, particularly for settling the equality cases. As 
indicated before, one solution is to consider absolutely continuous curves to insure 
that variation of a function can be expressed in terms of its derivative; we tacitly 
used that property several times in our argument). Another “hne point” that makes 
the argument work is the fact that the “distance function in (C”', || • ||)/S'„”, being an 
minimum of a hnite number of convex functions, is differentiable almost everywhere 
(also when composed with an absolutely continuous curve) and, moreover, it has 
directional derivatives everywhere. 

(2) In the last part of the argument we did show that, under certain additional 
assumptions, if a curve 7 in U{n) verihes ['y(t),Y(t)] = 0, then it must be contained 
in a commutative subgroup. It appears likely that the argument carries over to a more 
general class of “sufficiently smooth” curves in U{n): what we seem to be exploiting 
is that the patern of equatities between eigenvalues is constant. 
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4. The “1-Complemented” Subgroups 


Let G = SO{m) or U{n) and Q (= so{n) or u{n)) - the Lie algebra of G. Let H 
be a connected Lie subgronp of G, Ti - the corresponding Lie snbalgebra of Q and 
M = G/H. In this section we concentrate on the case when G and M are endowed 
with metric strnctnres indnced by the operator norm, we will call the respective 
metrics by p and pm (for technical pnrposes, we may nse other nnitary ideal norms, 
though). The purpose of this section is to prove, in the above context, an estimate 
of type (2) for an abstract class of homogeneous spaces that contains C(n), SO{n) 
and the Grassmannians Gn,k- The argument will depend on a careful analysis of the 
exponential map exp \ Q ^ G and maps obtained from it; as an illustration we 
point out here that the following result from |T^ is an immediate consequence of the 
results from the preceding two sections. 


Theorem 7. If G = SO{n) or U{n) (endowed with the operator norm or the 
induced intrinsic metric p) and e G (0,2], then 

(-)"<A'(G,£)<(-)t 

e e 

where d is the (real) dimension of G and c and G are universal numerical constants. 

Proof. By Lemma 3 and the hrst assertion of Lemma 4, exp is a contractive surjective 
map from the (closed) ball of radius vr in ^ to G. Consequently, Lemma 2 applied 
with $ = exp and L = 1 and combined with the second inequality in (1) (or, more 
precisely, the upper stimate on N" given by Lemma 1) yields the upper estimate 
for N{G,e). The lower estimate is obtained similarly by applying Lemma 2 to <h = 
exp~^, L = 2.5 and K = {u & G : p{u,I) < 7r/4}, using the hrst inequality in 
(1) (or, again more precisely, the lower estimate on N given by Lemma 1), the last 
assertion of Lemma 4 and Proposition 6(a). (Proposition 6(a) is needed only for the 
dehnition of K and its use may be avoided here.) 

If M 7 ^ G, the approach will be similar, but the situation is (necessarily) more 
complicated. Let g : G — G/H = M be the quotient map and consider the short 
exact sequence 0—i>G—>-0 and the induced sequence of maps between 
the tangent spaces (at resp. I E H, I E G and H E M = G/H) 

^ ThM 0 (14) 


and so ThM can be identihed with the quotient space G/H] we mean by that iso- 
metrically identihed whenever all metric structures are induced by a given unitarily 
invariant norm. Since the derivative of the exponential map at 0 is the identity (in 
particular an isometry), we can realize that identihcation by the canonical factoriza¬ 
tion of the derivative of g o exp at 0 (which maps G to ThM and vanishes on H) 
through G/H. This shows that (at least small) neighbourhoods in M resemble balls 
in the normed space G/H and gives some heuristic evidence that inequalities of type 
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(2) may hold for M. However, for a proof of such an inequality one needs “uniform 
isomorphic” (rather than “inhnitesimal”) estimates, and we will obtain these under 
some additional technical assumptions. Since the additive structure on Q and the 
group structure on G are not intertwined by the exponential (or any other) map, it 
will be more convenient to identify Q/Ti. with X = (the orthogonal complement 
of 7-f in and to consider $ = g o exp|_;t., hoping that the direct sum df © Td = ^ is 
“well-behaving” with respect to the operator norm (or any other unitarily invariant 
norm that we may need to consider), which happens in many natural examples. This 
leads to our first invariant related to a homogeneous space. We set 


K{M) = \\Px\\ = \\I-Pnl (15) 

where Pe denotes the orthogonal projection onto E and || ■ || is calculated with respect 
to the operator norm. 

Before stating the results, we will present a simple but illuminating example which 
shows that, in general, the “linearization” of M of the type suggested above may work 
only on the “inhnitesimal” scale (i.e. only very small neighbourhoods are “equiva¬ 
lent” to balls in the tangent space), and which leads to one more invariant of M. Let 
G = U{n) and H = SU{n). It is then easily seen that M = U{n)/SU{n) is isometric 
to a circle of radius 1/n and so covering numbers of N{M, e) are “trivial” if c > n/n. 
(Since M is 1-dimensional, it is necessarilly “isotropic” and so there are neighbour¬ 
hoods resembling segments of size comparable to the diameter of M; in particular (2) 
still holds. However, one can also produce “nonisotropic” examples: consider e.g., 
Lf = {/} X SU{n — 1) C U{n) = G.) The reason for this phenomenon is that SU{n) 
(or (7Y, via the exponential map) is very “densely woven” into U{n). For example, 
^ 2 m/nj ^ SU{n) and —/|| < 27r/n (more precisely, /) = 27r/n 

by Proposition 6(a)), even though the shortest path connecting / and and 

contained in SU{n) is of length 27r(l — 1/n) (this follows from the proof of Propo¬ 
sition 6(a), the length in question must be > than the length of the shortest path 
connecting (—271 + 2n/n, 2n/n ,..., 2n/n) and 0 in i/^/Sn that is contained in the 
plane {(t^) G R” ■ J2xk = 0}; another way to express this is that = e* with 

X E H forces ||a;|| > 27r(l — 1/n)). To quantify the phenomenon we introduce the 
following concept. Given 6* > 0, we will say that a closed connected Lie snbgronp 
H oi G = U{n) (or SO{n)) is 6'-woven if whenever u E H satisfies p{u,I) < 0 {p is 
the intrinsic metric induced by the operator norm || ■ ||oo), then there exists x E 7i, 
||a^||oo < 71 such that u = e^. If M = G/H, we set 

9{M) = sup{6' >0 ; H is 6'-woven} = dist{I, H\exp{B-E,{7T)), (16) 

the distance being calculated using p. We then have 
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Theorem 8. In the notation above, assume that k{M) 
(0, diam M], 


N{M,e)<{ 


Cdiam M 
e 


d 


1. Then, for any e G 


where d is the (real) dimension of M, diamM is calculated with respect to pm, and 
C > 0 is a universal constant. Moreover, if e E (0,6'(M)/4], then 


N{M,e) > 


,c9(M). 


where c> is a universal constant. The last estimate holds also if k{M) > 1, but 
the constant c may then depend on k{M). 


Proof. As suggested earlier, the proof will involve applying Lemma 2 to the (properly 
restricted) map q o its inverse, where A is the orthogonal complement in 

Q oiTC (the Lie subalgebra of Q corresponding to the subgroup H), and will be based 
on two lemmas that follow. Given r > 0, let Bx{r) be the ball in X of radius r 
(with respect to the operator norm) and centered at the origin. We then have, in the 
notation of Theorem 8: 


Lemma 9. If k{M) = 1, then q{exp{Bx{diam M))) = M. 

Lemma 10. There exist positive constants A = \{k,{M)) and vq = ro(K(M)) such 
that if r = min{ro, 6'(M)/4} and x,x' G Bx{r), then 

PM{q{e'"),q{e^')) > A||a;-a;'||. 


Remarks, (i) Calculating 6{M) is not difficult, particularly when H is semisim¬ 
ple. Indeed, suppose 9{M) < n (clearly the maximal possible value) and let u G 
R\exp(R^(7r)) be such that u = e^, ||/i||oo > tt, while ||m — /||oo = 9{M) < tt, in 
particular u = e^ for some x E Q, ||a;||oo < tt. Since the commutants of u and x are 
the same, it follows that 9{M) is “witnessed” inside a torus T in G containing u; 
moreover, T may be assumed to contain the one-parameter semigroup : t E R} 
and to be such that T fl is maximal in H. Consequently, to determine 9{M) we 
only need to examine maximal tori in H and their extensions to maximal tori in G. 
This is particularly easy if H is semi-simple: all conhgurations of the tori in question 
are then related by conjugation, and since the metric we consider is invariant under 
conjugation, it suffices to check just one such configuration. Such an examination 
will also reveal that 9{M) = tt, should that be the case. 

(ii) Since diamU{n) = tt and q, being a quotient map, is a contraction, one always 
has diamM < tt. In any case, by Proposition 6, one can always calculate diamM 
by examining images of one-parameter semigroups of G under q. 

(iii) If M = Gn,k (the Grassmann manifold), one verihes directly that k{M) = 1 and 
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diamM = tt/2. The former follows from the fact that X consists of those matrices 
in Q (= u{n) or so{n)) that are of the form (cf. (3)) 

0 X 

-X* 0 


The latter is elementary and presumably known: for two (fc-dimensional) subspaces 
E, F of R"' (resp. C”)), pg„ F) is the largest of the main angles between E and 
F (see (4) and comments following it for the framework and e.g., |T^, p. 174 for the 


more precise analysis). Finally, it follows immediately from the previous remark that 
S{Gn,k) = TT (the maximal tori in H are also maximal in G). 

(iv) If k{M) = 1, one can take in Lemma 10 r = .12 and A = .4 and if, moreover, 
x' = 0, one can take r = 5/9 and A = .4; it follows that q o exp {Bx{5/9)) D {U G 
G/H : p{U,q{I)) < 2/9} and that q o exp“^ restricted to any of these two sets is 
Lipschitz with constant 4.5. 

(v) The proof gives X{t) and ro(t) to be of order 1/t. The argument would be slightly 
more efficient if we considered X as endowed with the quotient norm G/'H, which is 
more natural in the context. 

Assuming the two lemmas above. Theorem 8 is shown almost exactly as Theorem 
7: one applies Lemma 2, hrst with $ = g o exp|_:^., L = 1 and K = Bx{diamM) for 
the upper estimate and then with the inverse map restricted to K = q{exp{Bx{T))) 
and with L = for the lower estimate. (All the hne points are hidden in Lemmas 
9 and 10.) 


Proof of Lemma 9. We will show that, for every p G [2, cx)), setting 

Kp = {x E X : ||a;||p < rG^^diam M} 

(i.e. Kp is a ball in X of radius n^^^diamM in the Schatten Cp-norm || ■ ||p), we have 

q{exp{Kp)) = M; (17) 

the assertion of the Lemma will then follow by letting p —> cx). To this end, observe 
that since the R-linear orthogonal projection M{n) —^ is of norm one (with respect 
to any unitarily invariant norm), k{M) equals to the norm of the orthogonal projec¬ 
tion from M{n) onto X. Now, since {M{n), || ■ ||p) is a complex interpolation space 
between {M{n), || ■ ||oo) and {M{n), || • II 2 ), it follows that Px is also contractive with 
respect to the Cp-norm (more generally, of norm < k(M)^“^/p). Furthermore, since 
the (Fp-norm is strictly convex for p G (1, cxd), we conclude that 

y^X^Wy- PnvWp = \\Pxy\\p < Ibllp- (18) 

For clarity, we will denote by Mp the manifold M equipped with the quotient metric 
Pp^M induced by the Schatten Cp-norm. Note that since the operator norm and the 
Cp-norm differ by factor at the most, we have diamMp < n^^^diam M. Let 
gPl G Mp and let 7 be the shortest geodesic in Mp connecting H and gH, then 
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£( 7 ) < n^^^diam M. Let 7 be a transversal lifting of 7 to G, i.e. a curve in G such 
that g o 7 = 7 and £( 7 ) = ^( 7 ). Then of course 7 is a geodesic in G (with respect 
to the intrinsic metric Pp induced by the Gp-norm) and without loss of generality we 
may assume that the initial point of 7 is I. By Proposition 6, 7 must be (perhaps 
after a change of parameter) of the form 7 (t) = 0 < t < 1 for some y & Q and 

£( 7 ) = lli/llp < rd-^^diamM, and so (18) will follow if we show that y G X. Indeed, 
if that was not the case, (19) would imply that \\y — Pny\\p < \y\\p and so, for t > 0 
sufficiently small we would have 

PpMe'^H, H) < pp{e^y, < \\ty - tPny\\p < t\\y\\p 

and consequently 

^( 7 ) = PpMiH, gH) = Pp^m{.H, eyp) 

< pp,M{H,e*yH) + pp^M{e^^H,eyH) < t\\y\\p+ (1 - t)\\y\\p , 

a contradiction. This proves Lemma 9. 

Proof of Lemma 10. We need to show that if x,x' G Bx{r) and h G 7i, then 

A = p(e^',e^e^) > A||a;-a;'||. 

Since ||a; — x'\\ > p{e^',e^) = p{e~^e^',1) and A = p{e~^e ^'it is enough to 
consider h eH such that 

p(e^, -^) < (1 + -^)||a^ ~ x'W < (1 + A)2r < 4r. 

If r < 6{M)/A (or just 2(1 + A)r < 6{M)), it follows from the dehnition of 6{M) (i.e. 
(17)) that h eH may be further assumed to satisfy ||h||oo < ti, hence 

||h||oo = < (1 + A)||a; -x'\\ < (1 + A)2r < 4r. 

Now, by Lemma 4 and Lemma 5, 

A = p{e^',e^e^) > p{e^', e^e^P) 

> p(e*'“G2^ ga;+/i/2^ _ || ( 3 ;'^/i/2] || — || [x, h/2]|| 

> 4>{r + ||/i||/ 2 )||a; — x' — h\\ — 2 r||/i|| 

> 0 (r + j|/ij|/ 2 )j|a; — a;'|| — 2r\\h\\ 

> (0(r + (1 + A)r) — 2 r(l + A))||a; — x'\\ 

> (0(3r) — 4r)||a; — x'\\, 

where 0(-) is the function from Lemma 4. It is now clear from Lemma 4 that if r > 0 
is small enough, then 0(3r) — 4r > 0. A more careful calculation along the same lines 
shows that if r = .12, then A = .4 works (as indicated in Remark (iv) above). 
Finally, if k{M) > 1, we can only use ||a; — x' — h\\ > k{M)~^\\x — x'\\ in the 
fourth inequality in the preceding argument. This results in the last expression being 
(0(3r)/T;(M)“^ — 4r)||a; — a;'||, which yields the assertion of the Lemma with tq and A 
being of order k{M)~^. 
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5. Extensions and Other Tricks 


The scheme presented in the preceding section yields resonable estimates for cov¬ 
ering nnmbers N{M,e) (with respect to the metric indnced by the operator norm) 
of a homogeneons space M = G/M whenever £ < diamM or e < 6{M) (for the up¬ 
per and lower estimate respectively) and whenever k{M) is appropriately controlled. 
This leaves several cases and gray areas that are not covered. 

(i) The range 6{M) < e < diamM even if k{M) = 1 and, in general, a clarihcation of 
the role of the ratio diam M/9{M) (the lower and upper estimates differing roughly 
by [diamM/9{M)Y). 

(ii) The upper estimate whenever k[M) > 1, but still ’’under control”. 

(hi) The case when we do not control k[M). 

With regards to (i), a modification of the example that led to the definition of 9[M) 
suggested there [H = {1} x SU[n — 1) <Z U[n) = G) shows that it is possible for 
diamM and 9{M) to differ by a large factor (of order n in that case). Even though 
an analysis of such cases is imaginably possible, it would be clearly combinatorial 
and/or algebraic in nature and we do not attempt it here. 

Concerning (iii), it is also conceivable that the phenomenon of having k[M) “large” 
can be “dissected” and expressed in terms of combinatorial/algebraic invariants sug¬ 
gested above, but, again, in the examples motivating this work (see below) we have 
||P-^|| = 1 and hence k{M) < 2. 

This leaves the gap related to (ii): the examples with, say, 1 < k[M) < 2 do 
naturally occur and it would be nice to have, at least for that case, an upper estimate 
for covering numbers of M of the type [Gdiam M/eY- Unfortunately, we do not know 
how to settle that question in full generality. Instead, we present a “bag of tricks” that 
allow to handle various special cases. This,and some comments concering covering 
numbers relative to metrics generated by unitarily invariant norms other than the 
operator norm constitutes this section. 

The first observation is that trying to mimmick the proof of Lemma 9 in the case 
when k[M) > 1 one arrives at the following picture. Let Q Q ^ Q/'H be the 
quotient map, and consider the semi-norm p on ^ defined by p[x) = ||(5a:||oo- Let 
A:^/7^—s>^bea norm-preserving lifting of Q (in general nolinear). The argument 
mimmicking the proof of Lemma 9 connects then geodesics in M with “rays” in the 
range of A and we could give upper estimates for entropy of M if we were able to 
control entropy of the range of M (e.g. with respect to the semi-norm p). 

The two specihc subgroups of U{n), for which estimates for covering numbers 
of the respective homogeneous spaces are of interest from the point of view of free 
1^ , Remark 7.2 and |^), consist of unitaries of some C'*-subalgebras 


probability (cf. 
of M[n), namely 

(1) The “block-diagonal” algebra: the commutant of {Pi, P 2 ,..., Pm}, where P/s are 
orthogonal projections whose ranges form an orthogonal decomposition of C”. 
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(2) The “tensor-factor” algebra: if n = mk, identify C” with C™ ® and consider 
matrices of the form I ®x, x E M{k)-, these can be also thonght of as block matrices 
with m identical k x k blocks along the diagonal. 

Let us comment here that the homogeneous space obtained from (1) is a “gen¬ 
eralized Stieffel manifold” of “orthogonal frames” of subspaces of C” (or with 
given pattern of dimensions. In both cases (1) and (2) the subgroup H (resp. the Lie 
algebra consists of (all) unitaries (resp. skew-symmetric matrices having form (1) 
or (2)), and the conditional expectation is a norm one (with respect to any unitarily 
invariant norm) projection from Q, the Lie algebra of G = U{n), onto Ti, in partic¬ 
ular k{G/H) < 2. However, except for the case m = 2 in (1) (i.e. the Grassmann 
manifold) or (2), we have k{G/H) > 1. 

Concerning the other parameters, it is easily seen that in all cases 6{G/H) = 2 
and 71/2 < diamG/H < tt. Accordingly, Theorem 8 gives good lower estimates for 
the covering numbers of G/H and it remains to handle the upper ones. We will use 
the following (ad hoc) 

Theorem 11. Let a G (0,1/2]; let n, G (= U{n) or SO{n)), H, M = G/H, Q, H, 
and d be as before and assume that 

mm.{6{M),diamM,K{M)~^} > a. 

Furthermore, asuume that one of the following holds 

(a) dim H < {1 — a) dim G 

(b) H aets reducibly on C” (resp. R” and there is a reducing subspace E with 
an < dim E < {1 — a)n 

(c) H acts reducibly on C” (resp. R*^) and there is a reducing subspace E with 
dim E = k > an and such that the orthogonal decomposition C” = i? © E^ in¬ 
duces an isomorphism H ^ U{k) x Hq for some subgroup Hq of U{n — k) (resp. 
Ii^,SO{k),SO{n-k)). 

Then, for any e G {f),diamM], 

(-)“ < JV(M,e) < (F", 

e e 

where c, G > 0 are constants depending only on a. 

Corollary 12. If H C U{n) is the group of unitaries of a “block-diagonal” or 
“tensor-factor” algebra (described in (1) or (2)), then the assertion of Theorem 11 
holds (e.g., with constants corresponding to 0 = 1/3/. 

Proof. We may of course assume m > 2. In the case (2) the condition (b) of Theorem 
11 is always satished (with a = 1/3). The same is true in the case (1) except if one 
of the projections Pj is of rank > n/3, in which case (c) holds. 



METRIC ENTROPY OF HOMOGENEOUS SPACES 


19 


Remarks, (i) As was pointed out in [^], Remark 7.2, the estimates for covering 
numbers given by our Corollary 12 (the “block-diagonal” case) allow sharp free en¬ 
tropy and free entropy dimension estimates. Similarly, the “tensor-factor” case of 
the Corollary implies estimates for free entropy and free entropy dimension of certain 
generators of free product von Neumann algebras ( [pl|] ). 

(ii) The “block-diagonal” case of Corollary 12 implies estimates for covering numbers 
of some sets of matrices needed in [Q. 


Proof of Theorem 11. As observed earlier, it is enough to show the upper estimate. 

(a) The condition (a) is equivalent to dim M > a dim G. It follows from Theorem 

7 that, for any £ G (0,2], G (hence, by Lemma 2, M = G/H) admits an £-net of 
cardinality < ^ e > (5 and (a) holds, this does not exceed G(a,/3)‘^™^ 

(the real dimension). It follows now from Lemma 10 that the image of Bx{r) (where 
r = r{a)) contains a ball in M of radius ri = ri{a), and so the former and the 
latter admit, for £ < r, an £-net of cardinality < (1^)'^*™-^. Combining this with 
the preceding observation (applied to /? = ri) we get the required upper estimate. 

(b) The condition (b) implies (a) (with a{l — a) in place of a). 

(c) Since the arguments in the real and complex case are identical, we restrict the 

discussion to the latter. The condition (c) is not included in (b) only if dim E = k > 
(1 — a)n, in particular k > n/2. Let Hi C G he the subgroup of the form U{k) x {/} 
in the sense indicated in the condition (c), then Hi G H and so M = G/H is 
a quotient of Mi = G/Hi. Now Mi is isomorphic to Gn,k x U{n — k) (for sure 
Lipschitz isomorphic with constant 2 if the product metric on the latter is dehned 
in the ‘Too sense”), and so, by Theorems 7 and 8, it admits, for any £ G (0,2], an 
£-net of cardinality < (again, the real dimension). Since k > n/2 implies 

dim Ml < 2dim M, arguing as in (a) we obtain the assertion. 

In some applications (see e.g. |]^, [|^]) it is important to know the metric entropy 
of M equipped with a metric induced by unitarily invariant norms other than the 
operator norm. The scheme presented in this paper can be adapted to yield fairly 
sharp results in the general case. Indeed, Lemmas 4, 5 and 6 involve statements 
about generic unitarily invariant norms. Similarly, Lemma 10 and its proof carry 
over almost word by word to the case of an arbitrary unitarily invariant norm || ■ || 
once the parameters such as 6 and n are properly interpreted: the balls Bx{-) 

are to remain to be dehned by the operator norm ]| ■ ||oo, but 9{M) has to be the 
distance between I and i7\ exp(R->^(7r)) in the intrinsic metric on G induced by || • ||; 
k{M) may be calculated using ]| ■ || (which results in a quantity not larger than the 
one given by the operator norm, in particular k{M) = 1 if ]| ■ ]| is the Hilbert Schmidt 
norm). The “linearization” procedure can be then implemented and the problem 
is reduced to estimating covering numbers of balls in X in the operator norm with 
respect to || ■ ||. As indicated in section 2 (see Lemma 1 and the paragraph preceding 
it), there exist numerous tools for obtaining such estimates. In particular, in many 




20 


STANISLAW J. SZAREK 


natural cases (e.g. M = U{n), SO{n) or the Grassmann manifolds Gn,k), the volumes 
of a ball in X with respect to a unitarily invariant norm || ■ || and the inscribed operator 
norm ball differ by a factor C^, G - an universal constant (this is easily implied e.g., 
by classical facts from [|^, |^, cf. p. 162; see also 0 or |^), which allows to 
use Lemma 1 to show that logA^(M, e:) d \og{diamM/e) (and diamM ^ ||/|| if 
M = U{n) or SO{n), diamM ^ ||.P||, where P is an orthogonal projection of rank 
equal to min{fc,?7, — k} if M = Gn,k)] these cases have been worked out in 
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